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HEAVY CHAIN MEMBER OF THE KINESIN FAMILY HUMAN HOMOLOG 
TECHNICAL FIELD 
The present iiivention relates generally to the field of molecular biology and particularly 
to novel polynucleotide and amino acid sequences for a human homolog of the PaclO gene of 
5 Saccharomyces cerevisiae. The present invention provides compositions and methods for the 
diagnosis and treatment of disease states such as X-linked centronuclear myopathy, schizophrenia 
and mental retardation. 

BACKGROUND ART 

Motor proteins in cells include myosin, which is actin-based, and kinesin, dynein and 
10 dynamin, which are microtubule-based (Lillie (1992) Nature 356:358-61). Kinesin has been 
identified as a transporter of membranous organelles in mammalian neurons. Genes related to 
kinesin heavy chain have been identified in Schizosacchamnivr^ f; pombe and Saccharomvces 
cereyisi^e . Most of the members of the kinesin family are implicated in mechanisms of mitosis 
or meiosis (Aizawa (1992) J Cell Biol 1 19: 1287-96). 
15 The PaclO gene of SaschaicmjjfiSS cerevisiae. which encodtesJiehea>^^ 

of the kin^in family, is required for viability in the absence of the kinesin-related CIN8 mitotic 
motor. If deleted with CIN8, an associated family member, the Pac lO/CINS double mutation is 
lethal and presumably prevents separation of the spindle pole bodies (Geiser JR and Hoyt MA, 
unpublished data). 

20 Centronuclear myopathy is an X-linked congenital myopathy (MTMI) that has been 

localized to Xq28 (Isselbacheret al fI994> Harrison's PrinHples n f Internal Medicine McGraw- 
Hill. New York, NY, pg 2387). Dahl et al. (1995; Am J Hum Genet 56:1 108-1 115) report that a 
young girl with a clinically moderate form of myotubular myopathy and associated mental 
retardation was found to cany a cytogenetically detectable deletion in Xq27-q28. Positional 

25 cloning of the MTMI locus has been refined to a 600kb region between the DX5304 and DX5497 
markers. Other diseases linked to the Xp27-q28 locus include schizophrenia, X-linked mental 
retardation associated with Fragile Site FRAXE, and anophthahnos. 

The neonatal form of centronuclear myopathy is X-linked and presents with severe 
hypotonia and weakness at birth. Patients may require respiratory assistance and a feeding tube 

30 due to swallowing difficulties. This form of the disease is often fatal. The early childhood form 
presents without difficulty at birth, but motor milestones, such as walking, running and stair 
climbing, are delayed. In this form the disease may be static or may progress to weakness. A 
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rare form of the disease has an onset in the second or third decade. The early childhood and 
adult forms appear to have autosomal dominant or recessive inheritance patterns. Patients with 
the neonatal form of centronuclear myopathy require careful management for respiratory support 
and gastric feeding, and patients with the early childhood disorder often require ambulatory aids 
5 and orthotic devices and less often wheelchairs. 

Schizophrenic disorders are serious mental illnesses that cause significant social, 
vocational, and personal disability. In the United States there are about 2 million affected 
individuals and these individuals account for an estimated loss of 20 billion dollars of lost 
productivity per year ( Harrison's Principles of Internal Medicine (supra) pg. 2414-1 5). After 
10 Down's syndrome, X chromosome linked mental retardation is the second most common genetic 
cause of mental retardation (McCance et al. PathophvsiolQg y. Mosby, St. Louis, MO, pg. 1420). 
Anopthalmos is a developmental defect characterized by mental retardation and a complete 
absence of the eyes or by the presence of vestigial eyes.. 

In view of the severity of the diseases associated with the Xq27-q28 locus, including 
15 centronuclear myopathy, myotubular myopathy, schizophrenia, X-linked mental retardation 
associated with Fragile Site FRAXE, and anophthalmos, it would be advantageous to provide an 
early and accurate method for the detection of such diseases. It would also be advantageous to 
provide therapeutic compositions and methods for prevention and treatment of such diseases. 

DISCLOSURE OF THE INVENTION 
20 Because the present invention relates to novel nucleotide and amino acid sequences 

disclosed herein for a PACIO human homolog and the nucleotide sequence has been mapped to 
the Xq27-q28 locus, the present invention also relates to the use of the PACIO human homolog in 
the diagnosis, prevention and treatment of diseases associated with the Xq27-q28 locus, such as 
centronuclear myopathy, myotubular myopathy, schizophrenia, X-linked mental retardation 
25 associated with Fragile Site FRAXE and anophthalmos. 

The nucleotide sequence for Incyte clone 41400, encoding the PACIO human homolog, 
was initially foimd among the nucleic acid sequences of a cDNA library made from hybrid cells 
of T-B lymphoblasts from a leukemic cell line. Clone number 41400 was found to have amino 
acid homology to the PaclO gene of Saccharomvces cerevisiae which encodes the heavy chain 
30 members of the kinesin family that are implicated in mechanisms of mitosis or meiosis. Based 
upon the results of a BLAST search (which stands for Basic Local Alignment Search Tool 
(Altschul SF (1993) J Mol Evol 36:290-300; Altschul, SF et al (1990) J Mol Biol 215:403-10)), 
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the PACIO human homolog appears to have a more distant relationship to Ihe beta cardiac 
myosin heavy chain (Qoffitolagus cunicutusi and the neuronal myosin heavy chain f Rattus 
rsUus). The PaclO human homolog also shares some nucleotide sequence homology with the 
serotonin receptors, GenBank accession numbers GI 1 77776 and GI 3643 1 . 
5 The present invention is therefore based on the discovery of a novel cytoskeletal gene 

which maps to the locus Xq27-28 that may be associated with disease states, such as 
cenironuclear myopathy, myotubular myopathy, schizophrenia, X-linked mental retardation 
associated with Fragile Site FRAXE and anophthalmos. The PACIO human homolog and 
nucleotide sequences that encode it and oligonucleotides, peptide nucleic acid (PNA), fragments, 
10 portions or antisense molecules thereof, provide the basis for diagnostic methods for the detection 
and/or quantitation of the PaclO human homolog that is associated with myopathy and X-linked 
mental retardation. For example, the nucleotide sequence disclosed herein, which encodes the 
PACIO human homolog, or fragments thereof, may be used in hybridization assays of biopsied 
cells or tissues or bodily fluids, such as amniotic fluid to detect the nucleic acid which may be 
1 5 associated with such disease states. 

An abnormal level of the PaclO human homolog nucleotide sequences or an abnormal 
transcript size in a biological sample may be characteristic of a regulatory state in which the 
molecules are over-expressed or under-expressed. Nucleotide sequences encoding the PACIO 
human homolog provide the basis for probes which can be used diagnostically to detect 
20 chromosomal aberrations such as deletions, mutations or chromosomal translocations in the gene 
encoding the molecule. Gene expression may be altered in such disease states or there may be a 
chromosomal aberration present in the region of the gene encoding the molecule. 

The present invention also relates, in part, to expression vectors and genetically 
engineered host cells comprising nucleotide sequences encoding the PaclO human homolog for in 
25 vitro or in vivo production of the nucleotide sequences. 

Additionally, the present invention relates to the use of the PACIO human homolog 
polypeptide, or fragment or variant thereof, to produce antibodies and to screen for antagonists or 
inhibitors of the PACIO human homolog which can be used diagnostically to detect and 
quantitate protein levels in disease states. 
30 Peptides or small molecules capable of modulating PAC 1 0 human homolog activity will 

provide the basis for pharmaceutical compositions for the treatment of disease states associated 
writh theXq27-28 locus. 
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The invention further provides diagnostic assays and kits for the detection of the PACIO 
human homolog in cells and tissues comprising the PACIO human homolog which may be used 
as a positive control, and anti-PACl 0 antibodies. Such antibodies may be used in solution-based, 
membrane-based, or tissue-based technologies to detect any disease state or condition related to 
5 the expression of protein or expression of deletions or variants thereof. 

The present invention also provides diagnostic assays and kits that comprise PaclO 
human homolog nucleic acid probes for the detection of mutations, deletions or translocations in 
the PaclO human homolog gene. Such probes can be used in hybridization assays, including 
PGR based techniques, to detect a disease state or condition associated with the PaclO gene. 
10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A-D display the polynucleotide (SEQ ID N0:1) and deduced amino acid (SEQ 
ID N0:2) sequence for the PACIO human homolog. 

Figure 2 displays the amino acid alignment of the PACIO human homolog with the 
S^h^romyges cerevisiag PaclO (GenBank 902026) and the SchizosacchaminvreQ pombe 
15 protein SPAC3H8.07c (GenBank 1 177665). Sequences were aligned using the multisequence 
alignment program of DNASTAR software (DNASTAR Inc. Madison WI). 

Figure 3 displays an analysis of the hydrophobicity characteristics of the PACIO human 
homolog based on the predicted amino acid sequence. 

Figure 4 displays the isoelectric point for the human PACIO human homolog as 
20 determined by MacDNAsis. The calculated isoelectric point [pi] for the PACIO human homolog 
is 7.04. 

Figure 5 displays the phylogenetic tree for Incyte Clone 41400, GI 1 177665 and fii 

mm. 

Figure 6 displays the partial amino acid alignment of Incyte Clone 41400 with the 
25 serotonin receptors, GenBank accessions GI 1 77776 and GI 3643 1 . 

Figure 7 illustrates that Incyte Clone 41400 is mapped on chromosome Xq27-q28. 
Figure 8 illustrates the assembly of partial cDNA sequences into a full length 
polynucleotide sequence encoding the PACIO human homolog. 

MODES FOR CARRYING OUT THE INVENTION 
30 The present invention relates to a novel, human cytoskeletal protein, PAC 1 0 human 

homolog which has 41 % amino acid sequence identity to the paclO gene of S. cerevisiae that 
encodes heavy chain member of the kinesin family. Nucleic acid sequences encoding the PACIO 
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human homolog have been mapped to the X chromosome disease locus Xq27-q28 found to be 
associated with X-linked mental retardation, schizophrenia, and congenital myophathies such as 
centronuclear myopathy. 

The present invention relates to the use of the pac 10 human homolog nucleic acid and 
5 PACl 0 human homolog amino acid sequences disclosed herein in the diagnosis and treatment of 
disease states mapped to the Xq27-q28 disease locus. 

The present invention also relates to the use of the pacIO human homolog nucleic acid 
sequences for the detection of mutations, including deletions and translocations in the gene 
encoding the PACIO human homolog. Such nucleic acid sequences could be used for the in 
10 utero fetal diagnosis of diseases such X-linked mental retaidation and X-linked congenital 
myopathies, such as the neonatal form of centronuclear myopathy. 

The present invention also relates to the use of the PACIO human homolog and 
genetically engineered host cells that express the PACl 0 human homolog to evaluate, screen and 
identify substances, compounds or synthetic drugs that modulate the activity of the PACIO 
IS human homolog. 

"Nucleic acid sequence" as used herein refers to an oligonucleotide, nucleotide or 
polynucleotide sequence, and fragments or portions thereof, and to DNA or RNA of genomic or 
synthetic origin which may be double-stranded or single-stranded, whether representing the sense 
or antisense strand. As used herein "amino acid sequence" refers to peptide or protein sequences 
20 or portions thereof. As used herein, lower case "pac 10 human homolog" refers to a nucleic acid 
sequence \vhereas upper case "PACIO human homolog" refers to a protein sequence. As used 
herein, peptide nucleic acid (PNA) refers to a class of informational molecules that have a neutral 
"peptide like" backbone with nucleobases that allow molecules to hybridize to complementary 
DNA or RNA with higher affinity and specificity than corresponding oligonucleotides 
25 (PerSeptive Biosystems 1-800-899-5858). 

As used herein, PACIO encompasses PACIO fix>m any mammalian species, including 
bovine, ovine, murine, porcine, equine and preferably human sources, in nattirally occurring or in 
variant form, or from any source, whether natural, synthetic, semi-synthetic or recombinant. 

As used herein, "naturally occurring" refers to PACIO with an amino acid sequence found 
30 in nature, and "biologically active" refers to PACIO having structural, regulatory or biochemical 
functions of the naturally occurring protein. As used herein, "inmiunological activity" is defined 
as the capability of the natural, recombinant or synthetic PACIO or any oligopeptide thereof, to 
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induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

The term "derivative" as used herein refers to the chemical modification of P AC! 0 human 
homolog. Illustrative of such modifications is replacement of hydrogen by an alkyl, acyl, or 
5 amino group. A PACIO protein derivative retains essential biological characteristics of the 
naturally occurring protein. 

As used herein, the term "purified" refers to molecules, either nucleic or amino acid 
sequences, that are removed from their natural environment and isolated or separated from at 
least one other component with which they are naturally associated. 
10 Thg Cpdme SpguCT^ys 

The nucleotide sequence of the pacIO human homolog (SEQ ID NO: 1) is shown in Figure 
1 . The nucleotide sequence of the pad 0 human homolog was initially found among the 
nucleotide sequences of a cDNA library made from hybrid cells of T-B lymphoblasts fix)m a 
leukemic cell line where it was found two times in 3071 usable nucleotide sequences. Nucleotide 

1 5 sequence encoding a portion of the PACIO human homolog were also found one time in 640" 
usable sequences of a cDNA library made from kidney tissue (Incyte library KIDNNOTOI); one 
time in 1404 usable sequences of a cDNA library made fixjm colon tissue (Incyte library 
COLNNOT09); three times in 5262 usable sequences of a cDNA libi^ made from the hNT 
teratocarcinoma cell line (Incyte library HNT2AGT0I); one time in 2018 usable sequences of a 

20 cDNA library made from the U937 monocyte cell line (Incyte library U937NOT01); one time in 
2214 usable sequences of a cDNA library made from the THP-1 promonocyte cell line treated 
with PMA and LPS (Incyte library THPlPLBOl); one time in 2460 usable sequences of a cDNA 
library made from the THP-l promonocyte cell line treated with PMA and LPS (Incyte library 
THP1PLB02); one time in 2952 usable sequences of a cDNA library made from tonsil tissue of 

25 an individual having lymphoid hyperplasia (Incyte library TONSNOTOI); one time in 3013 
usable sequences of a cDNA library made from prostate tissue (Incyte library PROSNOT06); one 
time in 23 1 9 usable sequences of a cDN A library made from bone marrow (Incyte library 
BMARNOT02); one time in 3742 usable sequences of a cDNA library made from lung tissue 
(Incyte library LUNGNOMOl); one time in 5810 usable sequences of a cDNA library made from 

30 the hNT2 teratocarcinoma cell line (Incyte library HNT2NOT01); one time in 7953 usable 
sequences of a cDNA library made from melanocytes (Incyte library MELANOMOI); and two 
times in 22,260 usable sequences of a cDNA library made from infant brain (BRAINOMOl ). As 
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used herein, the term usuable sequence refers to the total number of clones in a library after the 
removal of vector, nucleotide repeats, contamination, and mitochrondrial DNA. 

The full length cDNA of the paclO human homolog was searched against public protein 
databases using the BLAST algorithm which stands for Basic Local Alignment Search Tool 
5 (Altschul SF (1993) J Mol Evol 36:290-300; Altschul, SF et al (1990) J Mol Biol 215:403-10) 
and was found to have about 50% nucleic acid sequence homology to the Sacchammvces 
ggfgvisiag paclO gene which encodes heavy chain members of the kinesin. A BLAST search 
against Genpept92 revealed that the PAC 10 amino acid sequence has a more distant relationship 
to the beta cardiac myosin heavy chain (Oryfftolagus jaffikulus) and the neuronal myosin heavy 
10 chain ( Rattus rattus ). The PACIO amino acid sequence also has homology to the serotonin 
receptors, Genbank accession numbers GI 177776 and GI 36431. 

The nucleotide sequence of SEQ ID NO: I encodes a PACIO human homolog (SEQ ID 
N0:2) having 201 amino acids, having a predicted molecular weight of 21447.45 Daltons and an 
isoelectric point [pi] of 7.04. 
1 5 Methods for DNA sequencing are well known in the art and employ such enzymes as the 

KJenow fragment of DNA polymerase I, Sequenase® (US Biochemical Corp, Cleveland OH)), 
Taq polymerase (Perkin Ehner, Norwalk CN), thermostable T7 polymerase (Amersham, Chicago 
IL), or combinations of recombinant polymerases and proofieading exonucleases such as the 
ELONGASE Amplification System marketed by Gibco BRL (Gaithersburg MD). Methods to 
20 extend the DNA from an oligonucleotide primer annealed to the DNA template of interest have 
been developed for both smgle-stranded and double-stranded templates. Chain termination 
reaction products were separated using electrophoresis and detected via their incoiporated, 
labeled precursors. Recent improvements in mechanized reaction preparation, sequencing and 
analysis have permitted expansion in the number of sequences that can be determined per day. 
25 Preferably, tiie process is automated witfi machines such as the Hamilton Micix) Lab 2200 

(Hamilton, Reno NV), Peltier Thermal Cycler (PTC200; MJ Research, Watertown MA), the ABI 
Catalyst 800, and ABI 377 and 373 DNA sequencas (Perkin Elmer). 

The quality of any particular cDNA library in which polynucleotides encoding the PACIO 
human homolog are found may be determined by performing a pilot scale analysis of the cDNAs 
30 and checking for percentages of clones containing vector, lambda or £. soli DNA, mitochondrial 
or repetitive DNA, and clones with exact or homologous matches to public databases. 
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The polynucleotide sequence encoding the PACIO human homolog may be extended 
utilizing the nucleotide sequences from SEQ ID N0:1 in various methods known in the art to 
5 detect upstream sequences such as promoters and regulatory elements. Sarkar G et al ( 1 993 ; 
PCR Methods Applic 2:3 1 8-22) disclose "restriction-site polymerase chain reaction (PCR)" as a 
direct method which uses universal primers to retrieve imknown sequence adjacent to a known 
locus. First, genomic DNA is amplified in the presence of primer to a linker sequence and a 
primer specific to the knovm region. The amplified sequences are subjected to a second round of 
1 0 PCR with the same linker primer and another specific primer internal to the first one. Products of 
each round of PCR are transcribed with an appropriate KNA polymerase and sequenced using 
reverse transcriptase. 

Inverse PCR can be used to amplify or extend sequences using divergent primers based 
on a known region (Triglia T et al(1988) Nucleic Acids Res 16:8186). The primers may be 

1 5 designed using Oligo 4.0 (National Biosciences Inc, Plymouth MN), or another appropriate 
program, to be 22-30 nucleotides in length, to have a GC content of 50% or more, and to aimeal 
to the target sequence at temperatures about 68 ''-72*' C. The method uses several restriction 
enzymes to generate a suitable fragment in the known region of a gene. The fragment is then 
circularized by intramolecular ligation and used as a PCR template. 

20 Capture PCR (Lagerstrom M et al (1991) PCR Methods Applic 1 :l H-19) is a method for 

PCR amplification of DNA fragments adjacent to a known sequence in human and yeast artificial 
chromosome (YAC) DNA. Capture PCR also requuies multiple restriction enzyme digestions 
and ligations to place an engineered double-stranded sequence into an unknovm portion of the 
DNA molecule before PCR. 

25 Parker JD et al (1991; Nucleic Acids Res 19:3055-60), teach walking PCR, a method 

which permits retrieval of unknown sequence. PromoterFinder™ is a new kit available from 
Clontech (Palo Alto CA) which uses PCR, nested primers and special libraries to "walk in" 
genomic DNA. This process avoids the need to screen libraries and is useful in finding 
intron/exon junctions. 

30 Another PCR method. "Improved Method for Obtaining Full Ungth cDNA Sequences " 

by Guegler et al, Patent Application Serial No 08/487,1 12, filed June 7, 1995 and hereby 
incorporated by reference, employs XL-PCR™ enzymes (Perkin-Elmer, Foster City CA) to 
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amplify and/or extend nucleotide sequences. 

Preferred libraries for screening for full length cDNAs are ones that have been size- 
selected to include larger cDNAs. Also, random primed libraries are preferred in that they will 
contain more sequences which contain the 5' and upstream regions of genes. A randomly primed 
5 library may be particularly useful if an oligo d(T) library does not yield a foil-length cDNA. 
Genomic libraries are useful for obtaining introns and extending 5' sequence. 

A new method for analyzing either the size or confirming the nucleotide sequence of 
sequencing or PGR products is c^illary electrophoresis. Systems for rapid sequencing are 
available fiom Perkin Elmer, Beckman Instruments (Fullerton CA), and other companies. 

1 0 Capillary sequencing employs flowable polymers for electrophoretic separation, four different 
fluorescent dyes (one for each nucleotide) which arc laser activated, and detection of the emitted 
wavelengths by a charge coupled devise camera. Output/light intensity is converted to electrical 
signal using appropriate software (eg. Genotyper™ and Sequence Navigator™ programs from 
Perkin Elmer) and the entire process from loading of samples to computer analysis and electronic 

15 data display is computer controlled. Capillary electix)phoresis is particularly suited to the 
sequencing of small pieces of DNA which might be present in limited amounts in a particular 
sample. The reproducible sequencing of up to 350 bp of Ml 3 phage DNA in 30 min has been 
reported (Ruiz-Martinez MC et al (1993) Anal Chem 65:2851-8). 
Expression Systems 

20 In accordance with the present invention, pacl 0 human homolog polynucleotide 

sequences which encode the PACIO human homolog, fragments of the polypeptide, fusion 
proteins or fimctional equivalents thereof, may be used to generate recombinant DNA molecules 
that direct the expression of the paclO human homolog in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the 

25 same or a functionally equivalent amino acid sequence, may be used to clone and express the 
PACIO human homolog. As will be understood by those of skill in the art, it may be 
advantageous to produce PACIO encoding human homolog nucleotide sequences possessing non- 
naturally occurring codons. Codons preferred by a particular prokaryotic or eukaiyotic host 
(Murray E et al (1989) Nuc Acids Res 1 7:477-508) can be selected, for example, to increase the 
30 rate of expression or to produce recombinant RNA transcripts having desirable properties, such 
as a longer half-life, than transcripts produced fiom naturally occurring sequence. 

Also included within the scope of the present invention are polynucleotide sequences that 
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are capable of hybridizing to the nucleotide sequence of SEQ ID N0:1 under conditions of 
intermediate to maximal stringency. Hybridization conditions are based on the melting 
temperature (Tm) of the nucleic acid binding complex, as taught in Berger and Kimmel (1987, 
Guidg \Q Molecular Cloning Tgchniqvies, Methods in Enzymology, Vol 152, Academic Press, 
5 San Diego CA) incorporated herein by reference, and confer a defined "stringency" as explained 
below, 

"Maximum stringency" typically occurs at about Tm-S^'C (5°C below the Tm of the 
probe); "high stringency" at about 5°C to 10°C below Tm; "intermediate stringency" at about 
lO^'C to 20^*0 below Tm; and "low stringency" at about 20*'C to 25°C below Tm, As will be 

10 understood by those of skill in the art, a maximum stringency hybridization can be used to 
identify or detect identical polynucleotide sequences while an intermediate (or low) stringency 
hybridization can be used to identify or detect similar or related polynucleotide sequences. The 
term "hybridization" as used herein shall include "the process by which a strand of nucleic acid 
joins with a complementary strand through base pairing" (Coombs J (1994) Dictionary of 

15 Biotechnology. Stockton Presss, New York NY) The process of amplification often follows the 
process of hybridization. The process of amplification as carried out in polymerase chain 
reaction technologies is described in DiefFenbach CW and GS Dveksler (1995, PGR Primer, a 
h^l^QmXQiy M?m^. Cold Spring Harbor Press, Plainyiew NY) and incorporated herein by 
reference. 

20 As used herein a "deletion" is defined as a change in either nucleotide or amino acid 

sequence in which one or more nucleotides or amino acid residues, respectively, are absent. 

As used herein an "insertion" or "addition" is that change in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or amino acid residues, 
respectively, as compared to the naturally occurring paclO human homolog. 

25 As used herein "substitution" results from the replacement of one or more nucleotides or 

amino acids by different nucleotides or amino acids, respectively. 

Altered paclO human homolog polynucleotide sequences which may be used in 
accordance with the invention include deletions, insertions or substitutions of different nucleotide 
residues resulting in a polynucleotide that encodes the same or a functionally equivalent PACIO 

30 human homolog. The protein may also show deletions, insertions or substitutions of amino acid 
residues which produce a silent change and result in a functionally equivalent PACIO hiunan 
homolog. Deliberate amino acid substitutions may be made on the basis of similarity in polarity, 
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charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues 
as long as the biological activity of the PACl 0 human homolog is retained. For example, 
negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino 
acids include lysine and arginine; and amino acids with uncharged polar head groups having 
5 similar hydrophilicity values include leucine, isoleucine, valine; glycine, alanine; asparagine, 
glutamine; serine, threonine, phenylalanine, and tyrosine. 

Included within the scope of the present invention are alleles of the pacIO human 
homolog. As used herein, an "allele" or "allelic sequence" is an alternative form of the pacIO 
human homolog. Alleles result from a mutation, ie, a change in the nucleic acid sequence, and 
10 generally produce altered mRNAs or polypeptides whose structure or function may or may not be 
altered. Any given gene may have none, one or many allelic forms. Common mutational 
changes which give rise to alleles are generally ascribed to deletions, additions or substitutions of 
amino acids. Each of these types of changes may occur alone, or in combination with the others, 
one or more times in a given sequence. 
15 The nucleotide sequences of the present invention may be engineered in order to alter a 

pacIO human homolog coding sequence for a variety of reasons, including but not limited to, 
alterations which modify the cloning, processing and/or expression of the gene product. For 
example, mutations may be introduced using techniques which are well known in the art, eg, 
site-directed mutagenesis to insert new restriction sites, to alter glycosylation patterns or to 
20 change codon preference, for example. 

In another embodiment of the invention, a paclO human homolog natural, modified or 
recombinant sequence may be ligated to a heterologous sequence to encode a fusion protein. For 
example, for screening of peptide libraries for modulators of paclO human homolog activity, it 
may be useful to encode a chimeric PACIO human homolog protein expressing a heterologous 
25 epitope tfiat is recognized by a commercially available antibody. A fusion protein may also be 
engineered to contain a cleavage site located between the paclO nucleotide sequence and the 
heterologous protein sequence, so that the PACIO human homolog protein may be cleaved and 
purified away from the heterologous moiety. 

In an alternate embodiment of the invention, the coding sequence of paclO human 
30 homolog could be synthesized, in whole or in part, using chemical methods well known in the art 
(see Caruthers MH et al (1980) Nuc Acids Res Symp Ser 215-23, Horn T et al(1980) Nuc Acids 
Res Symp Ser 225-32, etc). Alternatively, the protein itself could be produced using chemical 
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methods to synthesize a PACl 0 human homolog amino acid sequence, in whole or in part. For 
example, peptides can be synthesized by solid phase techniques, cleaved from the resin, and 
purified by preparative high performance liquid chromatography (eg, Creighton (1983) Proteins 
Structures And Molecular Principles. WH Freeman and Co, New York NY). The composition of 
5 the synthetic peptides may be confirmed by amino acid analysis or sequencing (eg, the Edman 
degradation procedure; Creighton, supra) 

Direct peptide synthesis can be performed using various solid-phase techniques (Roberge 
JY et al (1995) Science 269:202-204) and automated synthesis may be achieved, for example, 
using the ABI 431 A Peptide Synthesizer (Perkin Elmer) in accordance with the instructions 
10 provided by the manufacturer. Additionally, the amino acid sequence of the PACIO human 
homolog, or any part thereof, may be altered during direct synthesis and/or combined using 
chemical methods with sequence from other a subunits, or any part thereof, to produce a variant 
polypeptide. 

Identification of Transformants 

1 5 Although the presence/absence of marker gene expression suggests that the gene of 

interest is also present, its presence and expression should be confirmed. For example, if the 
paclO human homolog is inserted within a marker gene sequence, recombinant cells containing 
the PACIO human homolog insert can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a pad 0 human homolog sequence 

20 under the control of a single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the PACIO human homolog as well. 

Alternatively, host cells vAiich contain the coding sequence for PACIO human homolog 
and express the protein may be identified by a variety of procedures known to those of skill in the 
art. These procediu^s include, but are not limited to, DNA-DNA or DNA-RNA hybridization 

25 and protein bioassay or immunoassay techniques which include membrane-based, solution-based, 
or chip-based technologies for the detection and/or quantification of the nucleic acid or protein. 

The presence of the paclO human homolog polynucleotide sequence can be detected by 
DNA-DNA or DNA-RNA hybridization or amplification using probes, portions or firagments 
disclosed in SEQ ID N0:1 . Nucleic acid amplification based assays involve the use of 

30 oligonucleotides or oligomers based on SEQ ID NO: 1 to detect transformants containing the 
paclO human homolog nucleic acid. As used herein "oligonucleotides" or "oligomers" refer to a 
nucleic acid sequence of at least about 10 nucleotides and as many as about 60 nucleotides. 
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preferably about 15 to 30 nucleotides, and more preferably about 20-25 nucleotides which can be 
used as a probe or amplimer. Preferably, oligonucleotides are derived from the 3' region of the 
paclO human homolog nucleotide sequence shown in Figure I. 

A variety of protocols for detecting and measuring the expression of a PAC 1 0 homolog 
5 polypeptides, using either polyclonal or monoclonal antibodies specific for the protein are known 
in the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and fluorescent activated cell sorting (FACS). A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on a 
PACIO homolog polypeptide is preferred, but a competitive binding assay may be employed. 
1 0 These and other assays are described, among other places, in Hampton R et at ( 1 990, Serological 
Mgthpds, a LaboratQiy Manvwl. APS Press, St Paul MN) and Maddox DE et al ( 1983, J Exp Med 
158:1211). 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
and can be used in various nucleic and amino acid assays. Means for producing labeled 
1 5 hybridization or PGR probes for detecting paclO human homolog polynucleotide sequences 
include oligolabeling, nick translation, end-labeling or PGR amplification using a labeled 
nucleotide. Alternatively, the paclO nucleotide sequence, or any portion of it, may be cloned into 
a vector for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
20 appropriate RNA polymerase such as T7, T3 or SP6 and labeled nucleotides. 

A number of companies such as Pharmacia Biotech (Piscataway NJ), Promega (Madison 
WI), and US Biochemical Corp (Cleveland OH) supply commercial kits and protocols for these 
procedures. Suitable reporter molecules or labels include those radionucUdes, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors. 
25 magnetic particles and the like. Patents teaching the use of such labels include US Patents 
3,817,837; 3,850,752; 3.939,350; 3,996,345; 4,277,437; 4,275,149 and 4,366,241. Also, 
recombinant immunoglobulins may be produced as shown in US Patent No. 4,8 16,567 and 
incorporated herein by reference. 
PH rifieatioa of PACIO Human Homoln g 
30 Host cells transformed with paclO human homolog nucleotide sequences may be cultured 

under conditions suitable for the expression and recovery of the encoded protein from cell 
culture. The protein produced by a recombinant cell may be secreted or may be contained 
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intracellularly depending on the sequence and/or the vector used. As will be understood by those 
of skill in the art, expression vectors containing a pac 10 human homolog can be designed with 
signal sequences which directs secretion of the PACIO human homolog through a particular 
prokaryotic or eukaryotic cell membrane. Other recombinant constructions may join the pac 1 0 
5 human homolog polynucleotide sequences to nucleotide sequence encoding a polypeptide 

domain which will facilitate purification of soluble proteins (KroU DJ et al (1993) DNA Cell Biol 
12:441-53; see also above discussion of vectors containing fiision proteins). 

The PACIO human homolog may also be expressed as a recombinant protein with one or 
more additional polypeptide domains added to facilitate protein purification. Such purification 

10 facilitating domains include, but are not limited to, metal chelating peptides such as 

histidine-tryptophan modules that allow purification on immobilized metals (Porath J (1992) 
Protein Expr Purif 3:263-281), protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system 
(Immunex Corp, Seattle WA), The inclusion of a cleavable linker sequence such as Factor XA or 

15 enterokinase (Invitrogen, San Diego CA) between the purification domain and the PACIO human 
homolog is useful to facilitate purification. 

Uses of PACIO and Genetically Engineered Host Cells Containing PACIO. 

The amino acid sequence of the PAC 1 0 human homolog is shown in Figure I . Based 
upon its homology to S. cercvisiae PACIO and the localization of nucleic acid encoding PACIO 
20 to the disease locus Xq27-q28, the human PACIO homolog disclosed herein appears to be a 
cytoskeletal protein that is associated with the disease locus Xq27-q28. 

Accordingly, the present invention provides PACIO human homolog amino acid 
sequences and genetically engineered host cells that express the sequences to evaluate, screen and 
identify substances compounds or synthetic drugs that modulate the activity of the PACIO human 
25 homolog. 

In an embodiment of the present invention, PAC 1 0 human homolog or a variant thereof 
and/or a cell line that expresses the PACIO human homolog or variant thereof may be used to 
screen for antibodies, peptides, or other molecules, such as organic or inorganic molecules made 
by combinatorial chemistry, that act as modulators of the PACIO human homolog activity. Anti- 
30 PACIO antibodies capable of neutralizing the activity of the PACIO human homolog may be 
identified. Synthetic compounds, natural products, and other sources of potentially biologically 
active materials can be screened in a number of ways deemed to be routine to those of skill in the 
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art. 

Antibodies 

Procedures well known in the art may be used for the production of antibodies to PACIO 
human homolog polypeptides. Such antibodies include, but are not limited to, polyclonal, 
5 monoclonal, chimeric, single chain. Fab fragments and fragments produced by a Fab expression 
library. Neutralizing antibodies, ie, those which inhibit biological activity of PACIO human 
homolog polypeptides, are especially preferred for diagnostics and therapeutics. Anti-PACIO 
human homolog antibodies can be used to localize or detect PACIO human homolog in dividing 
cells thereby providing a qualitative measurement of PACIO function. 
10 For the production of antibodies, various hosts including goats, rabbits, rats, mice, etc 

may be immunized by injection with PACIO human homolog polypeptide or any portion, 
fragment or oligopeptide which retains immunogenic properties. Depending on the host species, 
various adjuvants may be used to increase immunological response. Such adjuvants include, but 
are not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active 
15 substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, and dinitrophenol. BCG (bacilli Calmette-Guerin) and Corvnehacterinm 
mma are potentially useful human adjuvants which may be employed if purified PACIO 
human homolog polypeptide is administered to individuals. Monoclonal antibodies to PACIO 
human homolog polypeptide may be prepared using any technique which provides for the 
20 production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique originally described by Koehler and Milstein (1975, Nature 
256:495-497), the human B-cell hybridoma technique (Kosbor et al (1983) Immunol Today 4:72; 
Cote et al (1983) Proc Natl Acad Sci 80:2026-2030) and the EBV-hybridoma technique (Cole et 
^ (^^*5) Mpnoc l onai Antibodies and Cancer Ther«py Alan R Liss Inc. pp 77-96). In addition, 
25 techniques developed for the production of "chimeric antibodies", the splicing of mouse antibody 
genes to human antibody genes to obtain a molecule with appropriate antigen specificity and 
biological activity can be used (Morrison et al (1984) Proc Natl Acad Sci 81 :6851-6855; 
Neuberger et al (1984) Nature 312:604-608; Takeda et al (1985) Nature 3 14:452-454). 
Alternatively, techniques described for the production of single chain antibodies (US Patent No. 
30 4,946,778) can be adapted to produce specific single chain antibodies. 

Antibodies may also be produced by inducmg in vivo production in the lymphocyte 
population or by screening recombinant immunoglobulin libraries or panels of highly specific 
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binding reagents as disclosed in Orlandi et al (1989, Proc Natl Acad Sci 86: 3833-3837), and 
Winter G and Milstein C (1991; Nature 349:293-299). 

Antibody fragments which contain specific binding sites for the PAC 10 human homolog 
may also be generated. For example, such fragments include, but are not limited to, the F(ab% 
5 fragments which can be produced by pepsin digestion of the antibody molecule and the Fab 
fragments which can be generated by reducing the disulfide bridges of the F(ab')2 fragments. 
Alternatively, Fab expression libraries may be constructed to allow rapid and easy identification 
of monoclonal Fab fragments with the desired specificity (Huse WD et al (1989) Science 
256:1275-1281). 

1 0 PAC 1 0 human homolog-specific antibodies are useful for the diagnosis of conditions and 

diseases associated with expression of PAC 10 human homolog polypeptide. A variety of 
protocols for competitive binding or immunoradiometric assays using either polyclonal or 
monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the formation of a complex between a protein and its specific 

15 antibody and the measurement of complex formation. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies specific for two noninterfering epitopes on a 
specific PAC 10 human homolog protein is preferred, but a competitive binding assay may also 
be employed. These assays are described in Maddox DE et al (1983, J Exp Med 158: 121 1). 
Diagnostic Assays Using PACIO Specific Antibodies 
20 Anti-PAC 1 0 human homolog antibodies are useful for the diagnosis of diseases and 

conditions associated with PACIO human homolog activity, such as, centronuclear myopathy, 
myotubular myopathy, schizophrenia, X-linked mental retardation associated with Fragile Site 
FRAXE and anophtiialmos. Diagnostic assays for the detection of the PACIO human homolog 
include methods utilizing the antibody and a label to detect the PACIO human homolog 
25 polypeptide in human body fluids, cells, tissues or sections or extracts of such tissues. The 
polypeptides and antibodies of the present invention may be used with or without modification. 
Frequently, the polypeptides and antibodies will be labeled by joining them, either covalentiy or 
noncovalently, with a reporter molecule. A wide variety of reporter molecules are known to 
those of skill in the art. 

30 A variety of protocols for measuring a PAC 1 0 human homolog polypeptide, using either 

polyclonal or monoclonal antibodies specific for the respective protein are known in the art 
Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and 
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fluorescent activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on a PACIO human homolog 
polypeptide is preferred, but a competitive binding assay may be employed. These assays are 
described, among other places, in Maddox, DE et al (1983, J Exp Med 158:121 1). 
5 In order to provide a basis for the diagnosis of disease, normal or standard values for the 

PACIO human homolog polypeptide expression are established. This is accomplished by 
combining body fluids or cell extracts taken fix)m normal subjects, either animal or human, with 
antibody to a PACIO human homolog polypeptide under conditions suitable for complex 
formation using techniques which are well known in the art. The amount of standard complex 

10 formation can be quantified by comparing it with a dilution series of positive controls where a 
known amount of antibody is combined with known concentrations of purified PACIO human 
homolog polypeptide. Then, standard values obtained from normal samples may be compared 
with values obtained firom samples from subjects potentially affected by a disorder or disease 
related to PACIO human homolog expression. Deviation between standard and subject values 

1 S establishes the presence of the disease state. 
Drug Screening 

A PACIO human homolog polypeptide, its immunogenic fragments or oligopeptides 
thereof can b e used for screening thera peutic compounds in any of a variety of drug screening 
t echnique s. The fiagment employed in such a test may be free in solution, affixed to a solid 

20 support, home miaxeU^i^a^^ Theabolitionof activity or the 

formation of binding complexes, between a PACIO human homolog polypeptide and the agent 
being tested, may be measured. Accordingly, the present invention provides a method for 
screening a plurality of compounds for ^ecificbind^ with PACIO human homolog or 

a fragment thereof, comprising providing a plurality of compounds; combining a PACIO human 

25 homolog of the present invention or a fragment thereof with each of a plurality of compoimds for 
a time sufficient to allow binding under suitable conditions; and detecting finding of the PACIO 
human homolog, or fragment thereof, to each of the plurality of compounds, thereby identifying 
the compounds which specifically bind the PAC 1 0 human homolog. In such an assay, the 
plurality of compounds may be produced by combinatorial chemistry techniques known to those 

30 ofskillintheart. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to a PACIO human h molog polypeptide and is 
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described in detail in Geysen, European Patent Application 84/03564, published on September 
13, 1984, incorporated herein by reference. In summary, large numbers of different small peptide 
test compounds are synthesized on a solid substrate, such as plastic pins or some other surface. 
The peptide test compounds are reacted with PACIO human homolog fragments and washed. A 
5 bound PACIO human homolog of the present invention is then detected by methods well known 
in the art. A purified PACIO human homolog can also be coated directly onto plates for use in 
the aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies can be 
used to capture the peptide and immobilize it on a solid support 

This invention also contemplates the use of competitive drug screening assays in which 
10 neutralizing antibodies capable of binding the PACIO human homolog specifically compete with 
a test compound for binding the PAC 1 0 human homolog. In this manner, the antibodies can be 
used to detect the presence of any peptide which shares one or more antigenic determinants with 
the PAC 1 0 human homolog. 

Uses of the PaclO Human Homolog Polynucleotide Sequences 

1 5 A pac 1 0 human homolog polynucleotide, or any part thereof, provides the basis for 

diagnostic and/or therapeutic compounds. For diagnostic purposes, PACl 0 human homolog 
polynucleotide sequences are used to detect and quantitate gene expression in conditions, 
disorders and diseases in which their activity may be implicated, for example, in centronuclear 
myopathy, myotubular myopathy, schizophrenia, X-linked mental retardation associated with 

20 Fragile Site FRAXE and anophthalmos. For therapeutic purposes, pac 10 homolog antisense 
molecules are administered to individuals in conditions where it is desirable to regulate the Pac 1 0 
homolog gene thereby inhibiting its activity. Alternatively, for therapeutic purposes, sense 
polynucleotide sequences encoding PACIO human homolog or sequences thereof are 
administered to individuals in conditions where it is desirable to replace a naturally occurring 

25 mutated polynucleotide sequence, i.e. a pac 10 human homolog polynucleotide sequence that has 
been subject to a translocation, deletion, or point mutation, or to augment a naturally occurring 
polynucleotide sequence. 

Included in the scope of the invention are oligonucleotide sequences, antisense RNA and 
DNA molecules and ribozymes, wiiich function to destabilize pac 10 human homolog mRNA or 

30 inhibit translation of a pac 10 htmian homolog. 

Another aspect of the subject invention is to provide for nucleic acid hybridization or 
PCR probes which can detect polynucleotide sequences, including genomic sequences, encoding 



-18- 



wo 97/43413 PCT/US97/08092 

PACIO human homolog, or closely related molecules, such as alleles. In one embodiment of the 
present invention, paclO human homolog polynucleotide sequences are used as probes in 
diagnostic assays for the detection of mutations, deletions and mutations in the naturally 
occurring polynucleotide sequences. 
5 Diagnostic Uses of PaclO Polynucleotide 

A PACIO human homolog encoding polynucleotide sequence may be used for the 
diagnosis of diseases associated with the disease locus Xq27-q28. For example, polynucleotide 
sequences encoding PACIO human homolog can be used in hybridization or PGR assays of 
tissues from biopsies or autopsies or biological fluids, such as serum, amniotic fluid, or muscle 
10 biopsy. Such qualitative or quantitative methods include Southern or northern analysis, dot blot 
or other membrane-based technologies; PGR technologies; dip stick, pin or chip technologies; 
and ELISA or other multiple sample format technologies. AH of these techniques are well known 
in the art and are in fact the basis of many commercially available diagnostic kits. Specific pad 0 
human homolog polynucleotide probes can be designed for use in hybridization based diagnostic 
1 5 assays to ascertain if a naturally occurring pad 0 human homolog polynucleotide sequence 
contains a nucleic acid mutation or deletion or has been subject to a translocation. In particular, 
such assays could be used to detect the presence of X-linked congenital abnormalites, such as 
neonatal centronuclear myopathy, in fetal nucleic acid derived ftom anmiotic or chorionic villus 
samples. Early detection of congenital defects, such as centronuclear myopathy, alerts physicians 
20 and health care professionals and provides the opportunity for genetic counseling and/or early 
ther^eutic intervention. 

Hybridization assays can be tailored to evaluate the efficacy of a particular therapeutic 
treatment regime and are used in animal studies, in clinical trials, or in monitoring the treaunent 
of an individual patient For disease diagnosis, a normal or standard profile for expression is first 
25 established. This is accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with paclO homolog polynucleotide sequences or a portion 
thereof, under conditions suitable for hybridization or amplification. Standard hybridization is 
quantified by comparing the values obtained for normal subjects with a dilution series of positive 
controls run in the same experiment where a known amount of a purified pad 0 polynucleotide 
30 sequence is used. Standard values obtained fix)m normal samples are compared with values 
obtained from samples ftom subjects potentially affected by a disorder or disease related to paclO 
human homolog expression. Deviation between standard and subject values establishes the 
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presence of the disease state. If disease is established, an existing therapeutic agent or therapy 
plan is administered, and treatment profile or values may be generated. Finally, the assay may 
be repeated on a regular basis to evaluate progress towaid the normal or standard pattern. 
Successive treatment profiles may be used to show the efficacy of treatment over a period of 
5 several days or several months. 

PCR as described in US Patent Nos. 4,683,195; 4,800,195; and 4,965,188 provides 
additional uses for oligonucleotides based upon the pacIO human homolog polynucleotide 
sequence. Such oligomers are generally chemically synthesized, but they are generated 
enzymatically or produced from a recombinant source. Oligomers generally comprise two 
10 nucleotide sequences, one with sense orientation (5'->3*) and one with antisense (3*<-5') 

employed under optimized conditions for identification of a specific gene or condition. The same 
two oligomers, nested sets of oligomers, or even a degenerate pool of oligomers are employed 
under less stringent conditions for detection and/or quantitation of closely related DN A or RNA 
sequences. 

15 Additionally methods to quantitate the expression of a particular molecule include 

radiolabeling (Melby PC et all993 J Immunol Methods 159:235-44) or biotinylating (Duplaa C 
et al 1993 Anal Biochem 229-36) nucleotides, coamplification of a control nucleic acid, and 
standard curves onto which the experimental results are interpolated. Quantitation of multiple 
samples is speeded up by running the assay in an ELISA format where the oligomer of interest is 

20 supplied in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

Therapeutic Uses of a PaclO Human Homolog Polynucleotide 

A paclO human homolog antisense molecule can provide the basis for treatment of 
various abnormal conditions associated with the disease locus Xq27-q28 where it would be 

25 desirable to downregulate the presence of naturally occurring paclO human homolog or a mutated 
paclO human homolog, thereby inhibiting its activity. Alternatively, polynucleotide sequences 
encoding PACIO human homolog may provide the basis for gene therapy in the treatment of 
various abnormal conditions where it is desirable to introduce the pad 0 gene or up-rcgulate the 
presence of PACl 0 human homolog thereby enhancing its activity, for example in conditions 

3D where the naturally occurring sequence has been mutated. 

Expression vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or 
from various bacterial plasmids, are used for delivery of recombinant paclO human homolog 
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sense or antisense molecules to the targeted cell population. Methods which are well known to 
those skilled in the art can be used to construct recombinant vectors containing paclO human 
homolog-related nucleotide sequences. See, for example, the techniques described in Maniatis et 
al (supra) and Ausubel et al(supra). Alternatively, recombinant paclO human homolog can be 
S delivered to target cells in liposomes. 

The full length cDNA sequence and/or its regulatory elements enable researchers to use 
paciO human homolog nucleic acid as a tool in sense (Youssoufian H and HF Lodish 1993 Mol 
Cell Biol 13:98-104) and antisense (Eguchi et al (1991) Annu Rev Biochem 60:631-652) 
investigations of gene function. Oligonucleotides, designed from the cDNA or conu-ol sequences 
10 obtained from the genomic DNA can be used in vitro or in vivo to inhibit expression. Such 
technology is now well known in the art, and sense or antisense oligonucleotides or larger 
fragments can be designed fix>m various locations along the coding or control regions. 

Additionally, paclO human homolog expression can be modulated by transfecting a cell 
or tissue with expression vectors which express high levels of a paclO human homolog fragment 
15 in conditions where it would be preferable to inhibit its activity. Such constructs can flood cells 
with untranslatable sense or antisense sequences which compete for binding with the naturally 
occurring PACIO human homolog. Even in the absence of integration into the DNA, such 
vectors may continue to transcribe RNA molecules until all copies of the vector are disabled by 
endogenous nucleases. Such transient expression may last for a month or more with a non- 
20 replicating vector (Mettler I, personal communication) and even longer if appropriate replication 
elements are part of the vector system. 

Stable transformation of appropriate germ line cells, or preferably a zygote, with a vector 
containing pad 0 human homolog sense or antisense nucleic acid, or fragments thereof, may 
produce a transgenic organism that produces enough copies of the sense or antisense sequence to 
25 significantly modify the activity of the endogenous pac 1 0 human homolog nucleic acid. A pac 1 0 
human homolog sense nucleic acid fragment can be introduced into an organism when it is 
desirable to augment the presence of the endogenous pac 10 human homolog or to overcome the 
presence of an endogenous form of the pac 10 human homolog that is aberrant, ie subject to a 
mutation, deletion or translocation. A paclO antisense nucleic acid fragment can be introduced 
30 into an organism when it is desirable to eliminate the activity of the endogenous paclO nucleic 
acid. 

Modifications of gene expression can be obtained by designing antisense sequences to the 



-21- 



wo 97/43413 



PCT/US97/08a92 



control regions of the paclO human homolog gene, such as the promoters, enhancers, and introns. 
Oligonucleotides derived from the transcription initiation site, eg, between -10 and +10 regions 
of the leader sequence, are preferred. Antisense RNA and DNA molecules may also be designed 
to block translation of mRNA by preventing the transcript from binding to ribosoraes. Similarly, 
5 inhibition can be achieved using Hogeboom base-pairing methodology, also known as "triple 
helix" base pairing. Triple helix pairing compromises the ability of the double helix to open 
sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence specific hybridization of the 
10 ribozyme molecule to complementary target RNA, followed by a endonucleolytic cleavage. 
Within the scope of the invention are engineered hammerhead motif ribozyme molecules that 
specifically and efficiently catalyze endonucleolytic cleavage of pad 0 human homolog RNA 
sequences. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified 

15 by scanning the target molecule for ribozyme cleavage sites which include the follov/ing 

sequences, GUA, GUU and GUC. Once identified, short RNA sequences of between 1 5 and 20 
ribonucleotides corresponding to the region of the target gene containing the cleavage site may be 
evaluated for secondaxy sUiictural features which may render the oligonucleotide sequence 
inoperable. The suitability of candidate targets may also be evaluated by testing accessibility to 

20 hybridization with complementary oligonucleotides using ribonuclease protection assays. 

Both antisense RNA and DNA molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of RNA molecules. These include 
techniques for chemically synthesizing oligonucleotides such as solid phase phosphoramidite 
chemical synthesis. Alternatively, RNA molecules may be generated by in vitro or in vivo 

25 transcription ofDNA sequences encoding the antisense RNA molecule. Such DNA sequences 
may be incorporated into a wide variety of vectors with suitable RNA polymerase promoters such 
as T7 or SP6. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly can be introduced into cell lines, cells or tissues. 

DNA molecules may be modified to increase intracellular stability and half-life. Possible 

30 modifications include, but are not limited to, the addition of flanking sequences of the 5' and/or 3' 
ends of the molecule or the use of phosphorothioate or 2* 0-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. 
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Methods for introducing vectors into cells or tissue include those methods discussed infia. 
In addition, several of these transformation or transfection methods are equally suitable for sx 
vivo therapy. 

Furthermore, the paclO human homolog polynucleotide sequences disclosed herein may 
5 be used in molecular biology techniques that have not yet been developed, provided the new 
techniques rely on properties of nucleotide sequences that are currently known, including but not 
limited to such properties as the triplet genetic code and specific base pair interactions. 
DetecHon and Mapping of PaclO Human Homolog Polynucleotide Sequences 

The nucleic acid sequence for the paclO human homolog can also be used to generate 
10 hybridization probes as previously described, for mapping the endogenous genomic sequence. 
The sequence may be mapped to a particular chromosome or to a specific region of the 
chromosome using well known techniques. These include in sily hybridization to chromosomal 
spreads (Verma et al ( 1 988) Human Chromosome^;: A Man. ,al of Basic Technig n^s Pergamon 
Press, New York City), flow-sorted chromosomal preparations, or artificial chromosome 
15 constructions such as YACs, bacterial artificial chromosomes (BACs), bacterial PI constructions 
or single chromosome cDNA libraries. 

In situ hybridization of chromosomal preparations and physical mapping techniques such 
as linkage analysis using established chromosomal markers are invaluable in extending genetic 
maps. Examples of genetic maps can be found in Science (1995; 270:410f and 1994; 265:1981f). 
20 Often the placement of a gene on the chromosome of another mammalian species may reveal 
associated markers even if the number or arm of a particular human chromosome is not known. 
New sequences can be assigned to chromosomal arms, or parts thereof, by physical mapping. 
This provides valuable information to investigators searching for disease genes using positional 
cloning or other gene discovery techniques. Once a disease or syndrome, sucH as myotubular 
25 myopathy localized by genetic linkage to a particular genomic region, for example, to Xq27-q28 
(Dahl et al. 1 995 Am. J. Hum. Genet. 56: 1 108-11 1 5), any additional sequences mapping to that 
area may represent associated or regulatory genes for further investigation. The nucleotide 
sequence of die subject invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc between normal, carrier or affected individuals. 
30 Pharmaceutical Compositions 

The present invention relates to pharmaceutical compositions which may comj^se 
nucleotides, proteins, antibodies, antagonists, or inhibitors, or agonists of PACIO human 
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homolog in combination with at least one other agent, such as stabilizing compound, which may 
be administered in any sterile, biocompatible pharmaceutical carrier, including, but not limited to, 
saline, buffered saline, dextrose, and water. Any of these molecules can be administered to a 
patient alone, or in combination with other agents, drugs or hormones, in pharmaceutical 
5 compositions where it is mixed with excipient(s) or pharmaceutically acceptable carriers. In one 
embodiment of the present invention, the pharmaceutically acceptable carrier is pharmaceutically 
inert. 

Adminht rarton of Pharmaceutical Compositions 

Administration of pharmaceutical compositions is accomplished orally or parenterally. 
10 Methods of parenteral delivery include topical, intra-arterial (directly to the tumor), 
intramuscular, subcutaneous, intramedullary, intrathecal, intraventricular, intravenous, 
intraperitoneal, or intranasal administration. In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable pharmaceutically acceptable carriers 
comprising excipients and auxiliaries which facilitate processing of the active compounds into 
1 5 preparations which can be used pharmaceutically. Further details on techniques for formulation 
and administration may be found in the latest edition of "Remington's Pharmaceutical Sciences" 
(Maack Publishing Co, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
20 administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for ingestion by 
the patient. 

Pharmaceutical preparations for oral use can be obtained through combination of active 
compounds with solid excipient, optionally grinding a resulting mixture, and processing the 

25 mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. 
Suitable excipients are carbohydrate or protein fillers such as sugars, including lactose, sucrose, 
mannitol, or sorbitol; starch from com, wheat, rice, potato, or other plants; cellulose such as 
methyl cellulose, hydroxypropylmethyl-ccllulose, or sodium carboxymethylcellulose; and gums 
including arable and tragacanth; and proteins such as gelatin and collagen. If desired, 

30 disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl 
pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium alginate. 

Dragee cores are provided with suitable coatings such as concentrated sugar solutions. 



-24. 



wo 97/43413 PCTA;S97/08092 

which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent 
mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for product 
identification or to characterize the quantity of active compound, ie, dosage. 
5 Pharmaceutical preparations which can be used orally include push-fit capsules made of 

gelatin, as well as soft, sealed c^sules made of gelatin and a coating such as glycerol or sorbitol 
Push-fit capsules can contain active ingredients mixed with a filler or binders such as lactose or 
starches, lubricants such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
10 oils, liquid paraffin, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
active compounds. For injection, the pharaiaceutical compositions of the invention may be 
formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks's solution, Ringer's solution, or physiologically buffered saline. Aqueous injection 
15 suspensions may contain substances which increase the viscosity of the suspension, such as 
sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the active 
compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as 
ethyl oleate or triglycerides, or liposomes. Optionally, the suspension may also contain suitable 
20 stabilizers or agents which increase the solubility of the compounds to allow for the preparation 
of highly concentrated solutions. 

For topical or nasal administration, penetrants £q)propriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 
Manufacture and Sforagc 

25 The pharmaceutical compositions of the present invention may be manufactured in a 

manner that known in the art, eg, by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, 

30 etc. Salts tend to be more soluble in aqueous or therprotonic solvents that are the 

corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder in lmM-50mM histidine, 0.1%-2% sucrose, 2%-7% mannitol at a pH range of 4.5 to 5.5 
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that is combined with buffer prior to use. 

After pharmaceutical compositions comprising a compound of the invention formulated 
in a acceptable carrier have been prepared, they can be placed in an appropriate container and 
labeled for treatment of an indicated condition. For administration of pharmaceutical 
5 compositions, such labeling would include amoimt, frequency and method of administration. 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an eflFective amount to achieve the 
intended purpose. The determination of an effective dose is well within the capability of those 
10 skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in 
cell culture assays, eg, of neoplastic cells, or in animal models, usually mice, rabbits, dogs, or 
pigs. The animal model is also used to achieve a desirable concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 

15 administration in humans. 

A therapeutically effective dose refers to that amount of protein or its antibodies, 
antagonists, or inhibitors or agonists which ameliorate the symptoms or condition. Therapeutic 
efficacy and toxicity of such compounds can be determined by standard pharmaceutical 
procedures in cell cultures or experimental animals, eg, ED50 (the dose therapeutically effective 

20 in 50% of the population) and LD50 (the dose lethal to 50% of the population). The dose ratio 
between therapeutic and toxic effects is the therapeutic index, and it can be expressed as the ratio, 
ED50/LD50. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. 
The data obtained from cell culture assays and animal studies is used in formulating a range of 
dosage for human use. The dosage of such compounds lies preferably within a range of 

25 circulating concentrations that include the ED50 with little or no toxicity. The dosage varies 
within this range depending upon the dosage form employed, sensitivity of the patient, and the 
route of administration. 

The exact dosage is chosen by the individual physician in view of the patient to be 
treated. Dosage and administration are adjusted to provide sufficient levels of the active moiety 

30 or to maintain the desired effect. Additional factors which may be taken into account include the 
severity of the disease state, eg, tumor size and location; age, weight and gender of the patient; 
diet, time and frequency of administration, drug combination(s), reaction sensitivities, and 
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tolerance/response to therapy. Long acting pharmaceutical compositions might be administered 
every 3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate 
of the particular formulation. 

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total dose of 
5 about 1 g, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature. See US Patent Nos. 4,657,760; 5^06,344; or 
5,225,212. Those skilled in the art will employ different formulations for nucleotides than for 
proteins or their inhibitors. Similarly, delivery of polynucleotides or polypeptides will be 
specific to particular cells, conditions or locations, for example. 
10 It is contemplated that antagonists or agonists of PAC 1 0 human homolog can be 

delivered in a suitable formulation to individuals having conditions where it is desirable to inhibit 
or enhance, respectively, PAC 10 human homolog activity. 

These examples are provided by way of illustration and are not included for the purpose 
of limiting the invention. 

15 INDUSTRIAL APPLICABILITY 

I Construction of the TBLYNOTOI Library 

The TBLYNOPTOl library was constructed from the 174 x CEM.TI hybrid T + B 
lymphoblast cell line. The 174 x CEM.TL cell line is a cloned somatic cell hybrid of a human T 
lymphoblastoid cell line, CEM*.3, and a human B lymphoblastoid cell line, 721.174, which was 

20 produced by PEG-mediation fiision as described in Salter et al. (1985) Immunogenetics 21:235- 
246. The hybrid cell line expresses class II antigens encoded by CEM'^.3, HLA class I antigens 
encoded by both parental cell lines, and HLA-B5 and HLA-Bw6 antigens encoded by parental 
cell lines 721 .174 and CEM'^.S, respectively. Studies of this cell line indicated that two 
complementary trans-acting factors mediate enhanced expression of HLA class I antigens in the 

25 hybrid, one encoded by a gene on chromosome 6 of CEM'*.3, and the other encoded by the 
genome of 721.174 previously postulated to induce expression of CEM".3-encoded class I 
anUgcns in hybrids of CEM".3 with B lymphoblastoid cell lines (Salter et al. (1985)). 

Poly(A+) RNA (mRNA) was purified from the 1 74 x CEM.TL cell line. The RNA was 
primed with oligo d(T), and cDNA was synthesized from the mRNA. Synthetic adaptor 

30 oligonucleotides were ligated onto cDNA ends enabling their insertion into Uni-ZAP™ XR 
vector system (Stratagene), allowing high efficiency unidirectional (sense orientation) lambda 
library construction and the convenience of a plasmid system with blue/white color selection to 
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detect clones with cDNA insertions. Alternative unidirectional vectors include but are not 
limited to pcDNAI (Invitrogcn, San Diego, CA) and pSHlox-1 (Novagen, Madison, WI). 

The 174 X CEM.TL cell line lambda cDNA library (Catalog No. 937214, Stratagene) can 
be screened with either DNA probes or antibody probes and the pBluescript® phagemid 
5 (Stratagene) can be rapidly excised in vivo . The phagemid allows the use of a plasmid system for 
easy insert characterization, sequencing, site-directed mutagenesis, the creation of unidirectional 
deletions and expression of fusion proteins. The library phage particles were infected into E. coli 
host strain XL 1 -Blue® (Stratagene), which has a high transformation efficiency, increasing the 
probability of obtaining rare, under-represented clones in the cDNA library. 

10 The phagemid forms of individual cDNA clones were obtained by the in vivo excision 

process^ in which the host bacterial strain was coinfected with both the lambda library phage and 
an fl helper phage. Proteins derived from both the library-containing phage and the helper phage 
nicked the lambda DNA, initiated new DNA synthesis from defined sequences on the lambda 
target DNA and created a smaller, single stranded circular phagemid DNA molecule that 

IS included all DNA sequences of the pBluescript® plasmid and the cDNA insert. The phagemid 
DNA was secreted from the cells and purified, then used to re-infect fresh host cells, where the 
double stranded phagemid DNA was produced. Because the phagemid carries the gene for P- 
lactamase, the newly-transformed bacteria are selected on medium containing ampicillin. 

Phagemid DNA was purified using the QIAwell-8 Plasmid, QIAwell PLUS, or QlAwell 

20 ULTRA DNA Purification System (QIAGEN Inc. Chatsworth, CA). This product line provides a 
convenient, rapid and reliable high-throughput method to lyse bacterial cells and isolate highly 
purified phagemid DNA using QIAGEN anion-exchange resin particles with EMPORE™ 
membrane technology from 3M in a multiwell format. The DNA was eluted fix>m the 
purification resin already prepared for DNA sequencing and other analytical manipulations. 

25 The cDNA inserts from random isolates of the 174 x CEM.TL cell line library was 

sequenced in part by the method of Sanger F and AR Coulson (1975; J Mol Biol 94:441 f), using 
a Hamilton Micro Lab 2200 (Hamilton, Reno NV) in combination with four Peltier Thermal 
Cyclers (PTC200 from MJ Research, Watertown MA) and Applied Biosystems 377 or 373 DNA 
Sequencing Systems (Perkin Elmer), and the reading frame was determined. 

30 II Homology Searching of cDNA Clones and Their Deduced Pr teins 

Each cDN A was compared to sequences in GenBank using a BLAST search (Basic Local 
Alignment Search Tool; Altschul SF (1993) J. Mol. Evol. 36: 290-300; Altschul SF et al (1990) 
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J. Mol. Biol. 21 5:403-410). This method identified Incyte Clone 41400 as a non-exact match to 
the paclO gene of S- cerevjsiac (GenBank U29137) as well as a partial match to the serotonin 
receptors having Genbank accession numbers GI 177776 and GI 36431. 

BLAST was used to search for local sequence alignments . BLAST produces alignments 
5 of both nucleotide and amino acid sequences to determine sequence similarity. Because of the 
local nature of the alignments, BLAST is especially useful in determining exact matches and in 
identifying homologs. BLAST is useful for matches which do not contain gaps. The fundamental 
unit of BLAST algorithm output is the High-scoring Segment Pair (HSP). 

An HSP consists of two sequence fragments of arbitrary but equal lengths whose 
10 alignment is locally maximal and for which the alignment score meets or exceeds a threshold or 
cutoff score set by the user. The BLAST approach is to look for HSPs between a query sequence 
and a database sequence, to evaluate the statistical significance of any matches found, and to 
report only those matches which satisfy the user-selected threshold of significance. The 
parameter E establishes the statistically significant threshold for reporting database sequence 
15 matches. E is interpreted as the upper bound of the expected fixquency of chance occurrence of 
an HSP (or set of HSPs) within the context of the entire database search. Any database sequence 
whose match satisfies E is reported in the program output. 
Ill Determination of Reading Frame of cDNA Clone 

The reading frame of individual cDNA clones obtained from the TBL YNOTO 1 library 
20 was obtained by analyzing the polynucleotide sequences for the presence of start (ATG, GTG, 
etc.) and stop codons (TGA, TAA, TAG). Typically, one fi^e will continue throughout the 
major portion of all of a cDNA sequence and the other two pending firames tend to contain 
numerous stop codons. Algorithms for determining reading flame have been developed which 
analyze the occurrence of individual nucleotide bases of each putative codon triplet (Fickett, JW 
25 (1982) NucI Acids Res 10:5303). Coding DNA tends to contain predominantly certain 

nucleotides within certain triplet periodicities, such as a significant preference for pyrimidines in 
the third codon position. These algorithms have been incorporated into widely available software 
and are used to determine coding potential (and fiame) of a given stretch of DNA. This 
algorithm-derived information, combined with start/stop codon information, was used to 
30 determine proper fi^ame of individual clones within the library with a high degree of certainty, 
thus permitting the correct reading frame alignment with appropriate expression vehicles. 
IV Extensi n of PACIO Human H molog 
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The nucleic acid sequence of pac 10 human homolog is used to design oligonucleotide 
primers for obtaining full length sequences from genomic libraries. One primer is synthesized to 
initiate extension in the antisense direction (XLR) and the other is synthesized to extend sequence 
in the sense direction (XLF). The primers allow the known pac 10 human homolog sequence to 
5 be extended "outward" generating amplicons containing new, unknown nucleotide sequence for 
the control region of interest. The initial primers are designed from the cDNA using Oligo 4.0 
(National Biosciences Inc, Plymouth MN), or another appropriate program, to be 22-30 
nucleotides in length, to have a GC content of 50% or more, and to armeal to the target sequence 
at temperatures about 68°-72° C. Any stretch of nucleotides which would result in hairpin 

10 structures and primer-primer dimerizations is avoided. 

A human genomic library is used to extend and amplify 5' upstream sequence. If 
necessary, a second set of primers is designed to further extend the known region. By following 
the instructions for the XL-PCR kit (Perkin Elmer) and thoroughly mixing the enzyme and 
reaction mix, high fidelity amplification is obtained. Beginning with 40 pmol of each primer and 

1 5 the recommended concentrations of all other components of the kit, PGR is performed using the 
Peltier Thermal Cycler (PTC200; MJ Research, Watertown MA) and the following parameters: 



Step 1 


94° C for I min (initial denaturation) 


Step 2 


65° Cfor 1 min 


Step 3 


68° C for 6 min 


Step 4 


94° C for 15 sec 


Steps 


65° C for 1 min 


Step 6 


68° Cfor 7 min 


Step? 


Repeat step 4-6 for 1 5 additional cycles 


Steps 


94° C for 15 sec 


Step 9 


65° Cfor 1 min 


Step 10 


68° Cfor 7:15 min 


Step 1 1 


Repeat step 8- 10 for 12 cycles 


Step 12 


72° C for 8 min 


Step 13 


4° C (and holding) 



30 A 5-10 /il aliquot of the reaction mbcture is analyzed by electrophoresis on a low 

concentration (about 0*6-0.8%) agarose mini-gel to determine which reactions were successful in 
extending the sequence. The largest products or bands are selected and cut out of the gel. 
Further purification involves using a commercial gel extraction method such as QIAQuick™ 
(QIAGEN Inc). After recovery of the DNA, KJenow enzyme is used to trim single-stranded, 

35 nucleotide overhangs creating blunt ends which facilitate religation and cloning. 

After ethanol precipitation, the products are redissolved in 13 a^I of ligation buffer, 1/il 
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T4-DNA ligase (15 units) and T4 polynucleotide kinase are added, and the mixture is 
incubated at room temperature for 2-3 hours or overnight at 1 6° C. Competent cqU cells (in 
40 fx\ of appropriate media) are transformed with 3 fxl of ligation mixture and cultured in 80 fxl of 
SOC medium (Sambrook J et al, supra). After incubation for one hour at 37° C, the whole 
5 transformation mixture is plated on Luria Bertani (LB>agar (Sambrook J et al, supra) containing 
2x Carb. The following day, several colonies are randomly picked from each plate and cultured 
in 150 fxl of liquid LB/2xCarb medium placed in an individual well of an appropriate, 
commercially-available, sterile 96-well microtiter plate. The following day, 5 a<I of each 
overnight culture is transferred into a non-sterile 96-well plate and after dilution 1:10 with water, 
10 5 Ail of each sample is transferred into a PCR array. 

For PCR amplification, 18 a<1 of concentrated PCR reaction mix (3.3x) containing 4 units 
of rTth DNA polymerase, a vector primer and one or both of the gene specific primers used for 
the extension reaction are added to each well. Amplification is performed using the following 
conditions: 

15 Stepl 94 °C for 60 sec 

Step 2 94° C for 20 sec 

Step 3 55" C for 30 sec 

Step 4 72" C for 90 sec 

Step 5 Repeat steps 2-4 for an additional 29 cycles 

20 Step 6 72° C for 180 sec 

Step 7 4" C (and holding) 

Aliquots of the PCR reactions are run on agarose gels together writh molecular weight 
markers. The sizes of the PCR products are compared to the original partial cDNAs, and 
^propriate clones are selected, ligated into plasmid and sequenced. 
25 V Labelingof Hybridization Probes 

Hybridization probes derived from SEQ ID N0:1 are employed to screen cDNAs, 
mRNAs or genomic DNAs. Although the labeling of oligonucleotides, consisting of about 20 
base-pairs, is specifically described, essentially the same procedure is used with larger cDNA 
fragments. Oligonucleotides are labeled by combining 50 pmol of each oligomer and 250 mCi of 
30 [Y-*^P] adenosine triphosphate (Amersham, Chicago IL) and T4 polynucleotide kinase (DuPont 
NEN* Boston MA). The labeled oligonucleotides are purified with Sephadex G-25 super fine 
resin column (Pharmacia). A portion contauung 10' counts per minute of each is used in a typical 
membrane based hybridization analysis of human genomic DNA digested with one of the 
following endonucleases (Ase I, Bgl II, EcoR I, Pst I, Xba I, or Pvu II; DuPont NEN®). 
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The DNA from each digest is fractionated on a 0.7 percent agarose gel and transferred to 
nylon membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out 
for 1 6 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room 
temperature under increasingly stringent conditions up to 0. 1 x saline sodium citrate and 0.5% 
5 sodium dodecyl sulfate. After XOMAT AR™ film (Kodak, Rochester NY) is exposed to the 
blots in a Phosphoimager cassette (Molecular Dynamics, Sunnyvale CA) for several hours, 
hybridization patterns are compared visually. 

VI Antisense Molecules 

The paclO human homolog sequence, or any part thereof, is used to inhibit in vivo or in 
10 vitro expression of endogenous paclO human homolog, respectively. Although use of antisense 
oligonucleotides, consisting of about 20 base-pairs, is specifically described, essentially the same 
procedure is used with larger cDNA fragments. An oligonucleotide based on the coding 
sequence of the paclO human homolog is used to inhibit expression of endogenous pad 0 human 
homolog. Using Oligo 4.0, the complementary oligonucleotide is designed from the conserved 5' 
1 5 sequence and used to inhibit either transcription, by preventing promoter binding to the upstream 
nontranslated sequence, or translation of a pacIO human homolog transcript by preventing the 
ribosome from binding to the mRNA. 

VII Production of Antibodies 

For production of polyclonal antibodies, the deduced amino acid sequence of PACIO 
20 human homolog is analyzed using DNASTAR software (DNASTAR Inc) to determine regions of 
high immunogenicity and a corresponding oligopeptide is synthesized and used to raise 
antibodies in rabbits. Analysis to select appropriate epitopes, such as those near the C-terminus 
or in adjacent hydrophilic regions, is described by Ausubel FM et al (supra). An oligopeptide of 
about 15 residues in length is synthesized using an ABI Peptide Synthesizer Model 431 A (Perkin 
25 Elmer, Norwalk, CN) using fmoc-chemistry, and coupled to keyhole limpet hemocyanin (KLH, 
Sigma) by reaction with M-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS; Ausubel FM 
et al, supra). Rabbits are inimunized with the oligopeptide-KLH complex in complete Freund's 
adjuvant. The resulting antisera are tested for antipeptide activity, for example, by binding the 
peptide to plastic, blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting 
30 with radtoiodinated, goat anti-rabbit IgG. 

VIII Purification f PACIO Human Horn log Using Specific Antib dies 
Endogenous or recombinant PACIO human homolog can be purified by immunoaffmity 
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chromatography using antibodies specific for PACIO human homolog. An immunoaffinity 
column is constructed by covalently coupling PACIO human homolog specific antibody to an 
activated chromatographic resin such as CnBr-activated Sepharose (Pharmacia Biotech). After 
the coupling, the resin is blocked and washed according to the manufacturer's instructions. 
5 Media containing PACIO human homolog is passed over the immunoaffinity column, and 

the column is washed under conditions that allow the preferential absorbance of PACIO human 
homolog (eg, high ionic strength buffers in the presence of detergent). The column is eluted 
under conditions that disrupt antibody/protein binding (eg, a buffer of pH 2-3 or a high 
concentration of a chaotrope such as urea or thiocyanate ion), and PACIO human homolog is 
10 collected. 

IX Identification of Molecules Which Interact with PACIO Human Homolog 

PACIO human homolog, or a biologically active fragment thereof, is labeled with '^^I 
Bolton-Hunter reagent (Bolton, AE and Hunter, WM (1973) Biochem J 133: 529). Candidate 
small molecules previously arrayed in the wells of a 96 well plate are incubated with the labeled 
15 PACIO human homolog, washed and any wells with labeled PACIO human homolog complex 
are assayed. Data obtained using different concentrations of PACl 0 human homolog are used to 
calculate values for the number, affinity, and association of the protein with the candidate 
molecules. 

X. Mapping of the PaclO Human Homolog to the Xq27-q28 Locus 

20 The pad 0 human homolog nucleotide sequence was mapped to the X chromosome and 

localized to the Xq27-q28 locus by FISH in situ hybridization methods. 
Materials and Methods for the FISH in situ Method 
Slide preparaHnn 

Lymphocytes isolated fix)m human blood were cultured in minimal essential medium 
25 (MEM) supplemented with 10% fetal calf serum, and phytohemagglutinin (PHA) at 37 degrees C 
for 68-72 hours. The lymphocyte cultures were treated with BrdU (0.18mg/ml Sigma) to 
synchronize the cell population. The synchronized cells were washed three times with serum- 
free medium to release the block and recultured at 37 degrees C for 6 hours in MEM with 
thymidine, 2.5 g/ml (Sigma). Cells were harvested and slides were made by using standard 
30 procedures including hypotonic treatment, fixed and air-dried. 
In situ Hybridisation and FISH Detection 

cDNA probes were biotinylated with dATP using the BRL BioNick labeling kit at 1 5 
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degrees C for 2hr (Heng et al. (1992) Proc Natl Acad Sci 89:9509-13; Heng et al. (1994) Meth 
Mol Biol: In situ hybridization protocols . Humana Press, Clifton, NJ, pp. 35-49). 

The procedure for FISH detection was performed according to Heng et aL(1992, supra) 
and Heng et al. (1993) Chromosoma 102:325-332. Slides were baked at 55 degrees C for 1 hour. 
5 After RNase treatment, the slides were denatured in 70% formamide in 2xSSC for 2 minutes at 
70 degrees C followed by dehydration with ethanol. Probes were denatured at 75 degrees C for 5 
minutes in a hybridization mix consisting of 50% formamide and 10% dextran sulphate. Probes 
were loaded on die denatured slides. After overnight hybridization, slides were washed in 50% 
formamide in 2xSSC followed by washes in 2xSSC and detected as well as amplified using 

10 published methods (Heng et al, 1994, supra V For fluorescence detection, slides are incubated in 
3% BSA in 4xSSC prior to incubation with Avidin-FITC. The FISH signals and the DAPI 
banding pattern was recorded separately by taking photographs, and the assignment of the FISH 
mapping data with chromosomal bands was achieved by superimposing FISH signals with DAPI 
banded chromosomes (Heng et al, 1993, supra) . 

1 5 Under the conditions used, the hybridization efficiency for clone 4 1400 was 40%. Clone 

41400 was mapped to the X chromosome and localized to the locus Xq27-q28. Figure 7 
illustrates that incyte clone number 41400 is mapped on chromosome Xq27-q28. 
XI. Assembly of Pad 0 Nucleotide Sequences 

As illustrated in Figure 8, partial paclO human homolog cDNA sequences were 

20 assembled using the AutoAssembler™ 1 .4.0 from Applied Biosystems (ABI). The partial clones 
used in the assembly were Incyte Clones 40735 and 41400 (Incyte library TBLYNOTOl); 12844 
and 155634 (Incyte library THPlPLBOl made from a THP-1 promonocyte cell line treated with 
PMA and LPS); 826058 (Incyte library PROSNOT06 made from normal prostate tissue); 134937 
(Incyte library BMARNOT02 made fh>m bone marrow); 490767, 263915 and 262518 (Incyte 

25 library HNT2AGT01 made from the hNT-2 cell line, post mitotic neurons); 495844 (Incyte 
library HNT2NOT01 made from the hNT-2 cell line); 000632 (Incyte library U937NOT01 made 
fix)m the U937 monocyte cell line); 737713 (Incyte library TONSNOTOl made from tonsil tissue 
from an individual with lymphoid hyperplasia of the tonsils); 908587 (Incyte library 
COLNNOT09 made from colon tissue); and 1 15551 (Incyte library KIDNNOTOl made firom 

30 kidney tissue). Differences in nucleotide sequences, such as those occurring at nucleotide 
position 599 of Figure 8, may reflect inter-individual genetic variation. 

All publications and patents mentioned in the above specification are herein incorporated 
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by reference. Various modifications and variations of the described methods and system of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit 
of the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 
5 to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
intended to be within the scope of the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INF0Rr4ATI0N 
(i) APPLICANT: INCYTE PHARMACEUTICALS, INC. 

(ii) TITLE OF THE INVENTION: PACIO HUMAN HOMOLOG 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: U.S. 

(F) ZIP: 94304 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) PCT APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/648,736 

(B) FILING DATE: 16-MAY-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Billings, LucyJ. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE/ DOCKET NUMBER; PF-0073 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1314 base pairs 
{B} TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: TBLYNOTOl 

(B) CLONE: 41400 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TNCCCAAGNA GNCCATNCCT TTGGACAACN AGCGCAAATN ATTTACAGCA TCAAAATGGT 60 

TGCAAATCCT GGNCAAATGG GNCTGCAAAA CAGCAGCTTT GTTGGAAGCC TTNTTTTNTA 120 

ATTCCATGGC ATCAGAAGAC CCTTTAATGA GGCANATNAG TTCATCAANN ANAATNCCCC 180 

TGGTGGGCAN AAGACTAACT NAGTNACCAT TGCACCCAGA CGCCTGGGTG CATCACAGCA 240 
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TTNCTGCATC AAAAACATCA ATGTNATNGT TNCATNCCCC TGTGAGTNAA AAATNACACT 300 

CACACATGTT TTGTCACTGT NATTNCTGNA ANTNAAAACA TGCNGCNAAA ANAAAAAAAA 360 

AAAAAAAAAA ACGCAGGGGN CGTNACCATG CGCKCTCGCA TCCCCAAGAT GGCGGCCGTT 4 20 

AAGGACAGTT GTGGCAAAGG AGAAATGGCC ACAGGGAATG GGCGGCGGCT CCACCTGGGG 4 80 

ATTCCTGAGG CCGTGTTTGT GGAAGATGTA GATTCCTTCA TGAAACAGCC TGGGAATGAG 54 0 

ACTGCAGATA CAGTATTAAA GAAGCTGGAT GAACAGTACC AGAAGTATAA GTTTATGGAA 600 

CTCAACCTTG CTCAAAAGAA AAGAAGGCTA AAAGGTCAGA TTCCTGAAAT TAAACAGACT 660 

TTGGAAATTC TAAAATACAT GCAGAAGAAA AAAGAGTCCA CCAACTCAAT GGAGACCAGA 720 

TTCTTGCTGG CAGATAACCT GTATTGCAAA GCTTCAGTTC CTCCTACCGA TAAAGTGTGT 780 

CTGTGGTTGG GGGCTAATGT AATGCTTGAA TATGATATTG ATGAAGCTCA GGCATTGTTG 840 

GAAAAGAATT TATCGACTGC CACAAAGAAT CTTGATTCCC TGGAGGAAGA CCTTGACTTT 900 

CTTCGAGATC AATTTACTAC CACAGAAGTC AATATGGCCA GGGTTTATAA TTGGGATGTA 960 

AAAAGAAGAA ACAAGGATGA CTCTACCAAG AACAAAGCAT AATGCTGGCA ATTAAAAATG 1020 

TGGTTTAGTT TTCCAAACAT GTTATCTTAA ATACCCCTTT ATCCTTACAG GTTGACATAA 1080 

CTTTGAATGT TTTAACAGCA AGAATTTTAA GAAAAGATAA ACACCATTTT ATTTATTTAT 1140 

AAAAACAAAA TTAGTTTCAA ATATTTTTGA CATTGTGATT TTTTTTTCCA CATTTCTCAG 1200 

CAAAGCTAAT GGTATTTTAA TCATTATTTT TGCCTGTCAT AAGGAAACTC TTAGCTGAAA 12 60 

TGGCCGNAAA CTGTGNGNCA TGCTATGGAA GCTGAATGNC GGACGNTAGC ACAG 1314 

(2) INFORMATION FOR SEQ ID N0:2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: TBLYNOTOl 

(B) CLONE: 41400 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 2: 

Met Ala Thr Gly Asn Gly Arg Arg Leu His Leu Gly He Pro Glu Ala 

15 10 15 

Val Phe Val Glu Asp Val Asp Ser Phe Met Lys Gin Pro Gly Asn Glu 

20 25 30 

Thr Ala Asp Thr Val Leu Lys Lys Leu Asp Glu Gin Tyr Gin Lys Tvr 

33 40 45 

Lys Phe Met Glu Leu Asn Leu Ala Gin Lys Lys Arg Arg Leu Lys Gly 

50 55 60 

Gin He Pro Glu He Lys Gin Thr Leu Glu He Leu Lys Tyr Met Gin 

, 75 80 

Lys Lys Lys Glu Ser Thr Asn Ser Met Glu Thr Arg Phe Leu Leu Ala 

85 90 95 

Asp Asn Leu Tyr Cys Lys Ala Ser Val Pro Pro Thr Asp Lys Val Cvs 

100 105 110 

Leu Trp Leu Gly Ala Asn Val Met Leu Glu Tyr Asp He Asp Glu Ala 

115 120 125 

Gin Ala Leu Leu Glu Lys Asn Leu Ser Thr Ala Thr Lys Asn Leu Asp 

130 135 140 

Ser Leu Glu Glu Asp Leu Asp Phe Leu Arg Asp Gin Phe Thr Thr Thr 

150 155 160 

Glu Val Asn Met Ala Arg Val Tyr Asn Trp Asp Val Lys Arg Arg Asn 

165 170 
Lys Asp Asp Ser Thr Lys Asn Lys Ala 
180 185 
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CLAIMS 

1 . A purified polynucleotide comprising a nucleic acid sequence encoding tlie polypeptide 
having the sequence as depicted in SEQ ID N0:2. 

2. The polynucleotide of Claim 1 wherein the polynucleotide sequence comprises SEQ ID 
5 NO:I. 

3. An antisense molecule comprising the complement of the polynucleotide of Claim 2 or a 
portion thereof 

4. An expression vector comprising the polynucleotide of Claim 1 . 

5. A host cell transformed with the expression vector of Claim 4. 

10 6. A diagnostic composition for the detection of pac 1 0 human homolog polynucleotide 
sequences comprising the polynucleotide of Claim 2, or a fragment thereof 

7. A purified polypeptide comprising the amino acid sequence as depicted in SEQ ID NO:2. 

8. An antibody specific for the polypeptide of Claim 7. 

9. A method for producing a PAC 1 0 human homolog polypeptide having the amino acid 
1 5 sequence as depicted in SEQ ID NO:2, said method comprising the steps of 

a) culturing the host cell of Claim 5 under conditions suitable for the expression 
of said polypeptide; and 

b) recovering said polypeptide from the host cell culture. 

1 0. A method of screening a plurality of compounds for specific binding affinity with the 
20 polypeptide of Claim 7 or a portion thereof comprising the steps of: 

a) providing a plurality of compounds; 

b) combining the polypeptide of Claim 7 with each of a plurality of compounds 
for a time sufficient to allow binding under suitable conditions; and 

c) detecting binding of said polypeptide of Claim 7 to each of the plurality of 
25 compounds, thereby identifying the compounds which specifically bind said polypeptide of 

Claim 7. 

11. A diagnostic composition for the identification of PAC 1 0 human homolog polypeptide 
sequences comprising the antibody of Claim 8. 

12. A method for inhibiting the expression of PACIO human homolog in a cell comprising 
3D administering an effective amount of the antisense molecule of Claim 3 to said cell. 

13. A diagnostic test for the detection of pac 1 0 human homolog nucleic acid sequences 
encoding PACl 0 human homolog in a biological sample, comprising the steps of : 

-38- 
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a) combining the biological sample with a polynucleotide which comprising the 
nucleic acid sequence of SEQ ID NO: 1 , or a fragment thereof, under conditions suitable for the 
formation of a nucleic acid hybridization complex between the nucleic acid of SEQ ID NO: I and 
a complementary nucleic acid sequence in said sample, 
5 b) detecting said hybridization complex, and 

c) comparing the amount of said hybridization complex with a standard wherein 
the presence of an abnormal level of said hybridization complex correlates positively with a 
condition associated with an Xq27-q28 linked congenital abnormality. 
14. A diagnostic test for the detection of pad 0 human homolog nucleic acid sequences 
10 encoding PACIO human homolog in a biological sample, comprising the steps of: 

a) combining the biological sample with polymerase chain reaction primers under 
conditions suitable for nucleic acid amplification, wherein said primers comprise fragments of 
nucleic acid sequence of SEQ ID NO: 1 , 

b) detecting amplified nucleic acid sequences, and 

1 5 c) comparing the amount of amplified nucleic acid sequences in said biological 

sample with a standard thereby determining whether the amount of said nucleotide sequence 
varies from said standard, wherein the presence of an abnormal level of said nucleotide sequence 
correlates positively with a condition associated with an Xq27-q28 linked congenital 
abnomiality. 

20 
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